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Abstract 

ALICE is a unique opportunity for studying low pt physics and minimum bias 
events, and consequently for hunting substructures in strong interactions. The gen- 
eral question concerning global event properties in pp and pp physics is indeed 
whether substructures can be seen in the data in a model-independent way. In 
other terms, can we identify the interface between perturbative and non-perturbative 
regimes? 
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1 Introduction 

ALICE is a unique opportunity for studying low pi physics and minimum bias events, and 
consequently for hunting substructures in strong interactions. The general question concerning 
global event properties in pp and pp physics is indeed whether substructures can be seen in 
the data in a model-independent way. In other terms, can we identify the interface between 
perturbative and non-perturbative regimes? 

A successful approach to this question has recently emerged [|l]l, as will be shown in the 
following. 



2 A summary of substructure hunt in pp collisions 

What is known on the subject comes from data taken at ISR, at SppS collider (UAl, UA2 
and UA5 experiments) and the Tevatron collider (CDF and E735 experiments) as well as from 
dedicated theoretical work. 

2. 1 Energy widening of multiplicity distributions. 

It is to be stressed that recent results from Tevatron (E735 Collaboration, [^) on full phase space 
multiplicity distributions do not completely agree with those obtained at comparable energies 
at the SppS collider (UA5 Collaboration, ^ ^), see Fig. [T]. Tevatron data are more precise than 
SppS data at larger multiplicities (they have larger statistics and extend to larger multiplicities 
than UA5 data), but much less precise at low multiplicity. Both sets of data show a shoulder 
structure, but the Tevatron MD is somewhat wider. It should be noticed that E735 data are 
measured only in \ri\ < 3.25 and pt > 0.2 GeV/c then extended to full phase space via a Monte 
Carlo program. 

A standing problem! 

2.2 The fit with two negative binomial distributions. 

The negative binomial (NB) behaviour for final charged particles multiplicity distributions can 
be trusted in hadron-hadron collisions in full phase space only up to ISR energies [^. At higher 
energies shoulder structures start to be clearly visible as shown by the UA5 Collaboration at 
CERN pp collider The idea firstly suggested by C. Fuglesang [Q] is to explain observed 
NB regularity violations as the effect of the weighted superposition of two classes of events, the 
multiplicity distribution of each component being of NB type: 

-Pn ( Cl^soft 1 '^soft; ^softi '^semi-hard) ^semi-hard) Q^soft-fji ('^soft; ^soft) ~l~ (1 Oisoft) P-n ('^semi-hard; ^semi-hard)) 

(1) 

where, for each class, n is the average multiplicity and parameter k is linked to the dispersion 
D by = D'^/ (n)^ — 1/ {n). The two classes are interpreted as soft events (events without 
mini-jets) and semi-hard events (events with mini-jets) and consequently the weight asott is 
the fraction of events without mini-jets as measured by UAl [0]. The conclusion is that the 
proposed fit in terms of the superposition of two negative binomial multiplicity distributions 
(NBMD's) is quite good. 
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Figure 1: E735 results on charged particle multiplicity distributions in full phase space compared with UA5 results 
[pp. Data from the two experiments which were taken at nearly the same energy are rescaled by the same factor. 



2.3 Extrapolations to LHC energy. 

The point is to find acceptable energy dependence of the NB parameters k and n for the two 
components substructures and the corresponding weight factor ctsoft- In a region where QCD 
has no predictions one must proceed by phenomenological assumptions [||, which are sum- 
marised in the following. 

The first assumption concerns energy dependence of the total average charged particle mul- 
tiplicity [§: 



n = 3.01 - 0.474 In + 0.745 In^ v^. 



(2) 



The In s term is interpreted as the effect of the sharp increase of minijet production and/or 
double (even triple) parton collisions as suggested by the rapid increase of (pt) with multiplicity. 
Since below 200 GeV cm. energy one single NB describes multiplicity distribution data very 
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Figure 2: Average charged multiplicity in full phase space and in three pseudo-rapidity interval vs. cm. energy. 
Experimental data shown refer to non-single-diffractive data. The solid line shows Eq. (||), the dashed and dotted 

lines are described in the text. 



well and above 200 GeV cm. energy the soft component has been disentangled, it is proposed 
to extrapolate the logarithmic increase with energy of the average charged particle multiplicity 
of the soft component, n^ofu also at higher cm. energy. From n and Usofu one can calculate the 
behaviour of ttsoft, which, using the UAl result ^semi-hard ~ '^n^ofu gives asoft ~ at 100 TeV, 
but, if one allows again a In^ s term in ^semi-hard (with a small coefficient, of the order of 0.1, it 
is also consistent with the data) one still gets at that energy a sizable (20%) contribution from 
soft events. 

The second assumption concerns the energy dependence of the NB parameter k. For the 
soft component, KNO scaling was assumed, since it was found to hold in the ISR region. For 
the semi-hard components three scenarios were examined, two extreme ones (scenario 1 in 
which the semi-hard component also obeys KNO scaling and scenario 2 in which the semi-hard 
component violates KNO scaling with 1/k growing linearly with In s) and an intermediate case 
(scenario 3, where 1/k grows not as fast as linearly with a shape inspired by pQCD calculation). 

The behaviour of n in the TeV region based on UA5 data (Eq. @) is shown in Fig. |] 
(solid line). It is compared with the mentioned E735 results obtained by extrapolating to 
full phase space experimental data with a Monte Carlo calculation; observed discrepancies are 
clearly consequences of the differences noticed between E735 results and UA5 data on charged 
particle multiplicity distributions shown in Fig. |T] and discussed in Section |2?T| . Accordingly, 
one relies on Eq. @ predictions. 

Coming to particle rapidity density it should be pointed out that by assuming only a longi- 
tudinal growth of phase space and constant height of the rapidity plateau with cm. energy for 
semi-hard events, as done in Ref. [|^], CDF data [ p^ in pseudorapidity intervals are underes- 
timated (see Fig. 0, dashed lines). These data are well described by allowing a In^ s growth 
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Figure 3: Predicted charged particle multiplicity distributions in the two component model for the interval 
1 77 1 < 1 at Tevatron and LHC energies, where the two components (dashed lines) are scaled by cksoft and 1 — ctsoft 
respectively, Eq. ([l|), with a,soft = 0.6 at 1800 GeV and 0.3 at 14 TeV. 
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Figure 4: CDF measurements on average transverse momentum vs multiplicity in the full sample and in different 
classes of events (with and without minijets, called resp. 'hard' and 'soft' in this figure) 1 1 1 1 



of the total rapidity plateau (Fig. dotted lines): from this consideration one deduces a more 
appropriate growth of the semi-hard plateau height; the constraint is that fisemi-hard in full phase 
space follows a logarithmic growth with -^i as discussed above. Predicted charged particle 
multiplicity distributions in the three scenarios of the two component model for the interval 
\r]\ < 1 at Tevatron and LHC energies, calculated for the last mentioned case, are shown in 
Fig. H (notice that the comparison with CDF data, in view of the relatively large value of their 
resolution pt > 0.4 GeV/c, is questionable). If this behaviour for the semi-hard component will 
be confirmed by data one should conclude that semi-hard events populate mainly the central 
rapidity region giving an important contribution to the increase of charged particle density in 
central rapidity intervals. 



2.4 Soft and hard samples at Tevatron. 



It was found by CDF [ |12| ] that by subdividing the minimum bias sample into two groups, char- 
acterised respectively by the absence ('soft' events) or the presence ('hard' events) of mini-jets, 
interesting features of the reaction can be investigated. More precisely, a 'hard' event has been 
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defined as an event with at least one calorimeter cluster in \ri\ < 2.4, a cluster being defined 
as a seed calorimeter tower with at least 1 GeV transverse energy Et plus at least one contigu- 
ous tower with Et > 0.1 GeV. A subdivision which is interesting per se and can be tested at 
14 TeV. 

In summary, the soft component is found to satisfy KNO scaling (as expected in [^), while 
the hard one does not; also the {pt) distribution scales at fixed multiplicity in the soft component 
and not in the hard one; the dispersion of (pt) vs. the inverse of the multiplicity is compatible 
with an extrapolation to as n — cx) in the soft component but not in the hard one, indicating 
in this limit a lack of correlations in the soft component. 

The correlation between (pt) and multiplicity was explained by UAl JlBl ] as related to the 
onset of gluon radiation. It should be noticed that at CDF such a correlation is found to some 
extent for both the soft and the hard subsamples as shown in Fig. but the soft subsample is 
seen to start to saturate (CDF data shown have \r]\ < 1.0 and pt > 0.4 GeV/c). 



2.5 Low and high transverse momentum at SppS. 

An investigation has been recently carried out [jT^] on UAl data also based on the superpo- 
sition idea: particles are selected according to their pt (this selection should be contrasted 
with that performed by CDF in terms of classes of events). It is seen that the high-pt sam- 
ple (pt > 0.7 GeV/c) behaves very differently from the low-pt sample. This difference is 
interpreted as the effect of a more intense jet-like activity in the high-pt sample. As far as the 
low-pt sample is concerned, it was shown that the dependence of the correlation strength and of 
higher order cumulants on multiplicity is important in order to test different theoretical models 
(Monte Carlos are totally inadequate here): for this task the low-pt cut-off of ALICE at LHC is 
required. In addition, it would be very interesting to analyse in this way the soft and semi-hard 
components. 



2. 6 Average transverse momentum versus multiplicity. 



The study should be mentioned at Tevatron (E735) [ [151 ] of the correlation between (pt) and 
multiplicity done separately for pion, kaons and antiprotons, which shows that the behaviour 
is rather different, as illustrated in Fig. |5|, and still not theoretically understood. Data show 
saturation in pt at large multiplicity for tx^, not for and and p (but for kaons it saturates 
if a cut is imposed on momenta pt < 1-5 GeV/c, see [|16]1). At LHC, with 10^ events, the 
predictions discussed in Section ^3] allow the possibility to reach densities from 40 (scenario 
1) to 60 (scenario 2) particles per unit rapidity, with good statistics (1000 events). This fact has 
far reaching consequences which can be tested with ALICE especially at low momentum and 
the study can be extended to baryon production in the central region (Alice can also measure 
A). All this, plus the relative abundance of particle species, carries precious information on 
possible quark-gluon plasma formation, in particular in view of such properties as strangeness 
enhancement and baryon stopping (see elsewhere in this Chapter). 



3 Investigations in pp collisions with ALICE 
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Figure 5: Measurements by E735 [ |15[ | of average pt for identified particles vs. cm. energy and vs. particle density. 
The relative abundance of particles is also shown as a function of particle density. 



3.1 Shape of the multiplicity distribution. 

The average multiplicity {n) grows with the cm. energy y/s: the best fit involves a term pro- 
portional to In^ s, whose theoretical basis is not yet understood. Most theoretical works predict 
in fact a power-law in s or a linear rise with In s. Is LHC energy large enough to distinguish 
these behaviours? Purely statistical extrapolation based on the above mentioned scenarios (Sec- 
tion |Z^) show that the average multiplicity in full phase space can be measured with 0.2% error 
with 10^ events. 

The ratio D/ {n), where D is the dispersion, is constant if KNO scaling holds. KNO scaling 
is an asymptotic prediction: data at ISR energies are compatible with an 'early' KNO scaling, 
but SppS data clearly showed a violation. Purely statistical extrapolation based on the above 
mentioned scenarios shows that the variance can be measured with less than 1% error with 
10^ events both in full phase space and within the ALICE acceptance. 



3.2 Shape fits. 

It will be possible to verify the extent of forward-backward (FB) correlations seen at SppS and 
at Tevatron, to distinguish whether FB correlations grow or decrease and to study their link with 



theMD n. 



Predictions exist for the multiplicity distribution at 14 TeV, (e.g. ^ and [[TSp; an al- 
ternative point of view on the two-component structure based on impact parameter analysis is 
presented in [p^). As previously explained in Section ^3) , in [|S|, |^] three scenarios can be dis- 
tinguished at LHC by the value of k^^ ^ D'^/ (n)^. It turns out that both in full phase space 
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and in restricted pseudo-rapidity regions the difference between the scenarios can be sharply 
defined if D"^ / {n)"^ is measured to 15% accuracy, as seen from the following table: 











scenario 


f.p.s. 


77 < 1.5 


< 1 


1 


0.17 


0.37 


0.39 


2 


0.42 


0.74 


0.78 


3 


0.24 


0.54 


0.56 



Purely statistical extrapolation shows that in these scenarios 10^ events will yield 0.5% error, 
which is expected to be good enough to give extremely relevant information even if none of the 
envisaged scenarios turns out to be adequate (this would be a strong indication of the presence 
of an anomalous additional (hard?) component). 

The scenarios and predictions should be confirmed not only by the analysis of the first two 
moments: the whole distribution should be involved. Purely statistical extrapolation shows that 
data produced in one scenario cannot be adequately fitted by either one of the other scenarios 
already with 10^ events. Also fits based on one component only (of NB type) can be excluded 
with such statistics, as well as the other predictions. 

Again on the global side, the information entropy S = — P„ log P„ can be calculated 
and compared with the behaviour assumed in and the one implicit in [H, ^. Purely statisti- 
cal extrapolations based on the latter reference show that information entropy can be measured 
to better than 0.1%, both in full phase space and in |?7| < 1, with 10^ events, which is more 
than enough to distinguish again the mentioned scenarios among them and from the expecta- 
tion of a scaling with available phase space. Incidentally, the latter option would produce 
a correction to (n) of a few percent, thus perfectly identifiable by LHC already with 10^ events 
(ignoring again systematic errors). 

Feynman scaling (dN/dy around y = independent of ^i, where y is the rapidity), intro- 
duced initially at ISR energies, has been shown to be violated at higher energies: as discussed 
previously in Section ^3] , the violation is due not only to the increasing importance of semi-hard 
events, but one expects a growth also of the semi-hard plateau; this should be checked at LHC. 

The Ed^N/dp^ vs. pt distribution is interesting because it can be compared with results 
from Tevatron and SppS. Is it possible to identify two (or more) components (e.g., different 
slopes) in this distribution, even if the soft events have a much smaller pt than the hard ones? 
The CDF collaboration has shown that the single event's average pt is a good candidate for 
event classification, especially in combination with the event multiplicity (see again Fig. 
Multiplicity classes can be defined and whole distributions (e.g. in transverse momentum) can 
be looked at as a function of multiplicity or particle density [ [T2| ] . Very interesting information 
for investigating the mechanism of particle production can come from the study of identified 
particles. 



3.3 Sign oscillations of higher order moments. 

Hq moments (the ratio of factorial, Fq, to cumulant, Kq, moments) are very important in defin- 
ing substructures in the multiplicity distribution pT}], as experimentally measured by L3 at LEP 
[p2|], but require very large statistics in order to be measured. Purely statistical extrapolations 
show that, in \ri\ < 1, 10^ events allow to confidently calculate Hq up to order q = 7, but the 
expected 10^ events, will allow to reach at least order q = 14, quite adequate for the purpose. 
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In fact Hq vs. q oscillations in hadron-hadron collisions in the GeV region [El|, Rl] and in e+e 



annihilation at LEP energies p20 have been explained as the effect of the weighted superpo- 
sition of classes of different topology, each class being described by a NBMD with different 
parameters. Observed shoulder structure in final charged particle multiplicity distributions and 
Hq\s. q oscillations have in this framework the same origin. In addition being Hg the ratio of 
factorial to cumulant moments, Fg/Kg, the occurrence of the NBMD for a sound description 
of Hg behaviour in each substructure via generalised local parton-hadron duality (GLPHD), 
i.e. the statement that Fg at hadron level is equal to p'^Fg at parton level, with p defined by 
^^hadron = P^partonj Icads to the conclusion that Hg at hadron level coincide with Hg at parton 
level: an interesting possibility to understand deeply QCD and/or GLPHD. 



3.4 Underlying event for Higgs production. 



As Bjorken pointed out pSQ, at LHC energy processes will happen characterised by the pres- 
ence of virtual electroweak bosons in the hard subprocess, like WW scattering via W or even 
Higgs exchange, with the bosons treated as partons of the proton beam. If the W's then decay 
leptonically, a feature of the event will be a large rapidity gap, i.e., a region without hadrons 
separating the beam-jets containing the fragments of the projectiles. To turn this into a reliable 
signature, several issues were raised and discussed in [^: one of them demands to know how 
big the rapidity gap must be in order that multiplicity fluctuations do not mimic the effect of W 
scattering and decay. To this question a detailed answer can be given by studying low multi- 
plicity events: the scenarios previously described foresee 25 to 35% of events with less than 4 
charged particles in the central region |?7| < 1.5, which seems a considerable background to the 
W and Higgs events 

One should also not forget that the 'void probability' Pq, i.e., the probability of producing 
zero charged particles in a given rapidity interval, has interesting properties on its own, since in 



principle its n dependence determines the full MD [24]. 



3.5 Quark-gluon plasma. 

It has been argued that the increase of (pt) from ISR to SppS energies and its subsequent flatten- 
ing with multiplicity could be an indication of a phase transition; this view has been abandoned 
and data analysed later in terms of phase space constraints and the emergence of mini-jets. 
CDF analysis (see Fig. ^ of {pt) vs. central rapidity particle density, dN/dy, at 630 GeV and 
1800 GeV reveals a different behaviour for soft and semi-hard events: a saturation is seen in 
soft events at (pt) ~ 0.5 GeV/c, a fact to be contrasted with the increase from ^ 0.44 up to 
~ 0.7 GeV/c for (semi)hard events. In view of the high rapidity density expected with Alice, 
two important questions should be asked: a) does saturation effect for soft events continue up 
to 14 TeV, or, at a given particle rapidity density, (pt) starts to increase again? In the latter 
case, could the sharp increase be indicative of a deconfinement transition in hot hadronic matter 
[P5|]? b) does (pt) for semi-hard events continue to increase up to 14 TeV, or at a given particle 
rapidity density a saturation effect appears? One should expect that hard events at some particle 
rapidity density start to appear on top of the semi-hard ones in the first case, and the possible 
occurrence of a phase transition in the second one. 
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